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THE CORRELATION BETWEEN LENGTH OF FLOWERING-STALK 

AND NUMBER OF FLOWERS PER INFLORESCENCE IN 

NOTHOSCORDUM AND ALLIUM. 



BY 3. ARTHUR HARRIS. 



Several years ago DeVries in outlining some of the prob- 
lems of chief importance for future investigation in evolu- 
tionary work, designated correlation as one of the most 
important.* In another place in the same volume he dis- 
cusses data of his own for the relationship between the 
diameter of the stem and the length of the fruit in certain 
of his evening primroses, f 

Notwithstanding DeVries's emphasis of the importance of 
quantitative studies of correlation, comparatively little has 
been done by botanists. His suggestion concerning a rela- 
tionship between the size of the individual and the measur- 
able characters of the fruit called my attention to the 
desirability of a series of comparative studies of the inter- 
relationship between the vegetative development of organs 
of the individual and the number or fertility of the flowers 
or fruits which it produces. 

As material for a first determination of the interdepend- 
ence of length of flowering-stalk and number of flowers per 
inflorescence I shall use series of Nothoscordum and Allium. 
Data from the inflorescences of several other species are 
being examined. 

In the spring of 1906, a collection of the flowering-stalks 
of Nothoscordum striatum was made just south of the railway 
station at Meramec Highlands, near St. Louis, Mo. Here the 
brow of the hill slopes in two directions, to the southeast 
and to the southwest. The ground was more or less covered 
with old dead or leafless vegetation consisting chiefly of 
sumach bushes, large tufts of grass and weeds, and low grass. 
While no quantitative determination was made, it seemed to 
me that the old vegetation on the southwest slope was much 

* DeVries, H. Die Mutationstheorie. 1 : 113. 1901. 
t DeVries, I. c. 381-383. 
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more diversified from quadrat to quadrat in size and quan- 
tity than that upon the southeast slope. These remarks 
apply merely to the localities from which the samples of 
material were taken. 

It occurred to me that a consequence of this condition 
might be a greater variability of the length of the stem in 
the series from the southwest slope. Here some of the plants 
grew up through large tufts of grass or dead weeds, while 
others occurred in open spots. Some were close to shrubs of 
considerable size and others grew where the vegetation was 
much smaller. On the southeast slope, on the other hand, 
the grass was rather uniformly short and weeds and bushes 
very few. If variability in the individuals of a population 
be to a considerable extent due to diversity in their edaphic 
environment, as many writers have suggested, it would seem 
quite reasonable to expect a greater variability on the south- 
west slope where conditions were apparently more diverse 
than on the southeast slope. 



TABLE I. NOTHOSCORDUM. 
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CORRELATION IN FIRST COLLECTION, SOUTHWEST SLOPE. 

For the purpose of ascertaining whether the plants from 
these two spots, distant only a few yards from each other, 
do show differences in variability and correlation the material 
was collected in separate lots. That from the southwest 
slope comprised 359 flowering-stalks and the correlation be- 
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tween length of stalk and number of flowers per inflores- 
cence is shown in Table I, while that from the southeast 
slope numbered 426 plants whose characteristics are shown 
in Table II. 



TABLE II. NOTHOSCORDUM. 
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CORRELATION IN SECOND COLLECTION, SOUTHEAST SLOPE. 

In these tables the numbers in the left marginal column 
represent flowers per inflorescence while those along the top 
give the length of stalk to the nearest centimeter. 

A word must be said concerning the method of collecting 
the samples. To dig out the bulbs for so many individuals 
would have been an almost impossible task. By pulling 
steadily and gently on the stalk it breaks a considerable dis- 
tance under ground and at about the same point near the 
bulb. It is fairly easy to detect stalks which have broken 
sensibly above the usual point and such were discarded. 
This method of gathering the stalks is a source of some error. 
The means are necessarily slightly reduced below what they 
would have been could the bulbs have been dug out. Pos- 
sibly, too, the variabilities are slightly increased. But the 
purpose of this first investigation was merely to get some 
general idea of the degree of interdependence of length of 
peduncle and number of flowers per inflorescence, and for 
this purpose very precise measurements are superfluous. I 
think the error introduced by the method of collecting the 
plants is — for the purposes in hand — so slight that it may 
be neglected. At the time the material was gathered prac- 
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tically all the flowers had opened or were nearly ready for 
anthesis. All those with young buds were discarded to avoid 
the possibility of including stalks which had not completed 
their growth. These were very few in number and the 
samples taken may be regarded as representative of the 
species in this habitat. The rather coarse scale of centi- 
meter units was adopted partly because of the possible error 
in the collecting of the material referred to above and partly 
to obviate the apparently needless labor of dealing with 
much larger correlation tables. 

TABLE A. NOTHOSCORDUM. 



Organ and Source 
of Material 


Average and 
Probable Error 


Standard Devi- 
ation and Prob- 
able Error 


Coefficient of 
Variation 


Length of Stalk 
Southwest Slope 
Southeast Slope 
Difference 

Flowers per Inflor- 
escence 
Southwest Slope 
Southeast Slope 
Difference 


19. 407 ±.162 

17. 970 ±.115 

+ 1.437 ±.198 

6. 212 ±.056 

5. 423 ±.039 

+ .789±.068 


4.545±.114 

3.511±.081 
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19.541 

+ 3.880 
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22.435 

+ 3.070 



VARIATION CONSTANTS. 



The means, standard deviations and coefficients of vari- 
ation, and the probable errors of the first two constants, are 
given for length of stalk and number of flowers per inflor- 
escence in Table A. Sheppard's correction for the second 
moment was used for all the variabilities given in this paper. 

These constants require little discussion in this place. The 
means, standard deviations and coefficients of variation are 
all somewhat higher, and as the probable errors of the differ- 
ences show significantly higher, in the series from the south- 
west slope than in the other of the two lots . The empirical 
range is also wider, although N is smaller, in the collection 
from the southwest slope, where I had estimated that edaphic 
conditions were more variable than in that from the south- 
east slope where they had appeared to me more uniform. 
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But before any great stress is laid upon these results some 
real measure of the relative variability of the environmental 
conditions in different habitats should be secured. Until 
better data can be obtained these differences in the constants 
for two collections of material in habitats a stone's throw 
from each other may serve as a further warning against the 
collection of biometric material without due regard to envir- 
onmental conditions.* 

The coefficients of correlation are the constants which give 
the direct answer to the chief question in hand. They are: — 

Plants from southwest slope, r = . 549 ±.025 
Plants from southeast slope, r = . 490 ±.025 
Difference 059±.035 

These constants show at once that there is a very sub- 
stantial interdependence between the length of the flowering- 
stalk and the number of flowers per inflorescence. The two 
constants appear somewhat unlike to the eye, but when the 
probable error of their difference is taken into account it is 
clear that their divergence is not significant. 

To render the relationship between the length of the stem 
and the number of flowers per inflorescence more clear to 
those unacquainted with biometric terminology we may 
express the relationship in terms of regression instead of cor- 
relation. The equations to the regression straight lines are: — 

For the series from the southwest slope, 

i/ = 2.501 + .191x. 
For the series from the southeast slope, 

y=2.375 + .170a;. 

Here x denotes length of stalk and y number of flowers. 
From these regression coefficients we see that an increase of 
about two-tenths of a flower per inflorescence is on an aver- 
age associated with an increase of a centimeter in the length 
of the stalk. 

The relationship between the length of the stalk and the 
number of flowers per inflorescence may be represented 
graphically by the slope of the regression line, as in Diagram I. 

* For one of the first discussions of this subject see the papers on the 
sources of apparent polymorphism in plants in Biometrika. 1 : 304-319. 
1902. 
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DIAGRAM I. NOTHOSCORDUM. 
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SLOPE OF REGRESSION STRAIGHT LINES IN TWO COLLECTIONS. 



CORRELATIONS IN NOTHOSCORDUM AND ALLIUM. Ill 

The solid dots and the circles show the actually observed 
mean numbers of flowers per inflorescence for the several 
grades of stem length in the two series. The agreement 
between observation and theory is not very close, but N is 
not very large in either series and consequently the means 
of some of the arrays are based upon an inadequate number 
of observations. While the observed means do not fall so 
near the theoretical lines as one might like to see them, it 
seems quite idle to seek for a better equation than that for 
the straight line. Regression seems to be, so far as we can 
judge from the present data, linear. 

The material for Allium stellatum was taken in the fall of 
1907 from the hills at Meramec Highlands, Mo. The plants 
were not collected from such restricted habitats as those of 
Nothoscordum, but were drawn from a considerable range of 
territory, perhaps four square miles altogether. The method 
of gathering and measuring the stalks was the same as for 
Nothoscordum. In this case the range of variation is so 
wide that it is convenient to group in classes of three units 
for purposes of calculation. The data are presented in Table 
III; the calculated variation constants, in Table B. The 
coefficient of correlation between length of stalk and number 
of flowers per inflorescence is .619 ±.018. Here again the 
variation constants require no particular comment and are 
given merely for the convenience of the reader. The co- 
efficient of correlation is slightly higher than that for Nothos- 
cordum, but whether any significance is to be attached to 
the difference could only be determined by the collection of 
further series of material. 

The equation to the regression straight line is 

y- 15. 480+ 1.311 x, 

where x= length of stalk and y = number of flowers per in- 
florescence. The graph, Diagram II, shows that the empirical 
means fall very close to the theoretical line for all but the 
last three classes. Here there appears at first to be some 
indication of non-linear regression, but a glance at the cor- 
relation table shows that the means are based on only 4, 3, 
and 1 observations respectively and hence cannot be given 
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much weight. Under these circumstances I would rather 
devote time to the collection of additional data than to the 
mathematical analysis of those already on hand. Hence no 
further tests for linearity of regression are applied. 



TABLE III. ALLIUM. 
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TABLB B. ALLIUM. 



Average and Probable Error 
Standard Deviation and Prob- 
able Error 
Coefficient of Variation 



Length of Stalk 



39.491 ±.160 

5.446±.118 
13.791 



Number of Flowers 
per Inflorescence 



36.309 ±.340 

11.571 ±.249 
31.785 



VARIATION CONSTANTS. 



In both of these cases the regression seems to be linear, 
although in Nothoscordum the fit of the means to the theo- 
retical line is not very good, and in the case of Allium there 
is some question concerning the upper part of the range, 
where the observations are inadequate. Further tests of 
linearity might be applied, but it seems hardly worth while 
to do so at the present time. 
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The significance of linear regression in the present cases is 
two-fold. First, it shows that we are quite justified in using 
the coefficient of correlation to describe the degree of inter- 
dependence of length of flowering stem and number of flowers 
per inflorescence. Second, it shows our material to be in 

DIAGRAM II. ALLIUM. 
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SLOPE OF REGRESSION STRAIGHT LINE. 



this respect in agreement with the vast majority of organic 
relationships, f.or in most cases of organic correlation regres- 
sion is sensibly linear. 

In short the high positive values of r and the apparently 
linear regressions indicate that in this material the length of 
the stalk and the number of flowers per umbel are intimately 
associated and that the rate of change in the number of 
flowers as we pass from the shortest to the longest stalks 
remains practically the same throughout the entire range of 
variation of the latter character. 
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There are sources of danger in interpreting these coefficients 
of correlation biologically. We are dealing here with statis- 
tical constants which measure precisely the degree of simi- 
larity of the associated variables under consideration, but 
give no intimation whatever as to the cause of this simi- 
larity. The physiologist has long used the term correlation, 
and it must be borne in mind that the word has a somewhat 
different meaning in the terminology of quantitative biology. 
All that the coefficient of correlation does is to describe, not 
interpret, degrees of interdependence. In the present in- 
stance we are dealing with perennial plants, and it is not at 
all unreasonable to think that the age of the individuals 
may make considerable difference in the magnitude of both 
the length of the flowering-stalk and the number of flowers 
per inflorescence. In this case the correlation would be in- 
creased by an amount depending upon the age heterogeneity 
of the collections. There seems to be, however, no way of 
avoiding this difficulty in material of this kind gathered in 
nature. Comparison with plants of annual habit of growth 
might, perhaps, throw some light upon this question. 

In conclusion, the data presented show that there is a 
considerable degree of interdependence between the length 
of the flowering-stalk of these monocotyledons and the num- 
ber of flowers which they produce. It is not at all surpris- 
ing to find this relationship, and refined statistical methods 
would not have been required to demonstrate its existence 
in this particular case. To those who urge that it is pedantic 
to devote laborious researches to the demonstration of what 
is to be a priori expected, the reply is two-fold. First, in 
science, assumptions, however plausible on the surface, should 
never be accepted without actual demonstration on adequate 
material. The need of careful regard to this dictum is found 
in the group of problems we are considering here. In un- 
published data somewhat similar to these, for other species, 
I find results not at all in agreement with those set forth in 
the present paper, although there is a priori quite as much 
reason for expecting high positive correlations as in the spe- 
cies here considered. Second, we should not be content to 
merely demonstrate the existence of a relationship; we 
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should measure its intensity as well. When a sufficient 
number of such measurements are available, comparisons 
will permit broader generalizations and suggest deeper re- 
searches' than we can now foresee, or can ever hope to attain 
without strictly quantitative methods of investigation. 



